Exact Dependence Analysis for Increased Communication Overlap

نویسندگان

  • Simone Pellegrini
  • Torsten Hoefler
  • Thomas Fahringer
چکیده

MPI programs are often challenged to scale up to several million cores. In doing so, the programmer tunes every aspect of the application code. However, for large applications, this is often not practical and expensive tracing tools and post-mortem analysis are employed to guide the tuning efforts finding hot-spots and performance bottlenecks. In this paper we revive the use of compiler analysis techniques to automatically unveil opportunities for communication/computation overlap using the result of exact data dependence analysis provided by the polyhedral model. We apply our technique to a 5-point stencil code showing performance improvements up to 28% using 512 cores.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asymptotic Analysis of Binary Gas Mixture Separation by Nanometric Tubular Ceramic Membranes: Cocurrent and Countercurrent Flow Patterns

Analytical gas-permeation models for predicting the separation process across  membranes (exit compositions and area requirement) constitutes an important and necessary step in understanding the overall performance of  membrane modules. But, the exact (numerical) solution methods suffer from the complexity of the solution. Therefore, solutions of nonlinear ordinary differential equations th...

متن کامل

Relationship between dependence on modern communication tools and psychological well-being in students

Nowadays due to technological advance and its direct impact on the quality of human life, addressing the issue of health and its determinants is very important. Maintaining public health is one of the most fundamental issues in each community. The aim of the study was to survey relationship between dependence on modern communication tools (internet and mobile phones) and psychological well-bein...

متن کامل

Exact and Efficient Computation of the Expected Number of Missing and Common Words in Random Texts

The number of missing words (NMW) of length q in a text, and the number of common words (NCW) of two texts are useful text statistics. Knowing the distribution of the NMW in a random text is essential for the construction of so-called monkey tests for pseudorandom number generators. Knowledge of the distribution of the NCW of two independent random texts is useful for the average case analysis ...

متن کامل

Toucan - A Translator for Communication Tolerant MPI Applications

We discuss early results with Toucan, a sourceto-source translator that automatically restructures C/C++ MPI applications to overlap communication with computation. We co-designed the translator and runtime system to enable dynamic, dependence-driven execution of MPI applications, and require only a modest amount of programmer annotation. Co-design was essential to realizing overlap through dyn...

متن کامل

Recent Results Using the Overlap Dirac Operator

The overlap Dirac operator, derived from the overlap formalism for the special case of vector gauge theories, is a way to realize exact chiral symmetry on the lattice. Exact chiral symmetry on the lattice does come at a price – numerical implementation of the overlap Dirac operator is significantly more expensive than Wilson or staggered operator. In spite of this numerical hurdle, we already h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012